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Abstract 

A truncation of the overlap (domain wall fermions) is studied and a criterion for 
reliability of the approximation is obtained by comparison to the exact overlap formula 
describing massless quarks. We also present a truncated version of regularized, pure gauge, 
supersymmetric models. The mechanism for generating almost masslessness is shown to be 
a generalized see-saw which can also be viewed as a version of Froggatt-Nielsen's method 
for obtaining natural large mass hierarchies. Viewed in this way the mechanism preserving 
the mass hierarchy naturally avoids preserving even approximately axial U(l). The new 
insights into the source of the mass hierarchy suggest ways to increase the efficiency of 
numerical simulations of QCD employing the truncated overlap. 
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1. Introduction 

For massless quarks, QCD would have exact chiral symmetries. This simple observa- 
tion explains a large body of observations. At present we try to solve QCD by numerical 
methods using a lattice regularization. In the standard approach exact chiral symmetries 
cannot be preserved by this regularization. This is disappointing but not debilitating since 
chiral symmetries can be restored in the continuum limit. Still, there have been many 
attempts to get exact chiral symmetries on the lattice, even just as a matter of principle. 
These attempts often led to controversies and the issue is subtle. Recent progress seems to 
have been achieved, and, although controversy still exists on related issues, my impression 
is that most workers would agree that there does exist a well defined, albeit non-standard, 
way to preserve global chiral symmetries exactly on the lattice. The price is quite high 
though: The lattice model, although well defined, must be interpreted as containing an 
infinite number of fermions fields. Moreover, the expressions are complicated and usage in 
practice appeared, only a year or two ago, quite unlikely. However, recent developments 
raise the hope that this could change. 

In a recent publication [1] a substantially simpler formula for the effective action in 
lattice vector-like theories with exact global chiral symmetries was derived. It is based on 
the overlap which was developed [2,3,4] as a method to regulate chiral gauge theories on 
the lattice. Obviously, as such, the overlap must contain the vector-like as a particular 
case case, where the chiral symmetries are not gauged. Section 9 of [4] contains a specific 
discussion of the properties of the overlap in the vector-like context. What is new in [1], 
is that the expressions in [4] can be simplified. 

Let us first briefly review the basic features of the overlap relevant to the present 
context. (Although the focus is on four dimensions we shall try subsequently to write most 
equations in an arbitrary even dimension d.) Formally, in the continuum path integral, 
because the fermions enter only bilinearly in the action, one can write any correlation 
function as an average over gauge field configurations of an object obtained by integrating 
first over the fermions. This Grassmann integration produces a result factorized into two 
types of terms: A determinant (the exponentiated sum of vacuum fermion diagrams) and a 
combination of entries of inverse Dirac operators (propagators) . In the regularized overlap 
the information contained in fermion vacuum diagrams is stored in two (or one, see later) 
ground states of two auxiliary quantum mechanics problems, parametrically dependent on 
the gauge fields. The propagators are obtained by matrix elements of certain fermionic 
creation/annihilation operators between the two ground states. The crucial point is that 
each chiral component of a physical field is represented by a separate set of such operators. 
This implies immediately exact chiral symmetries, as the factorization is exact and the 
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system acts as if it had a simple bilinear fermionic action. The ground states also factorize 
into direct products of one factor for each chiral component. However, when the gauge 
field background carries nontrivial topology, the ground states, which for perturbative fields 
are singlets under the global chiral group, carry nontrivial charges. By this mechanism 
global anomalous conservation laws behave as expected (i.e. the respective charges are 
not conserved, in spite of the formal decoupling in the action). This property is needed in 
QCD, as is well known. 

In any regularization, as long as the fermionic action is bilinear, the chiral components 
either decouple or not. If we have a lattice model with a finite number of fields per unit 
Euclidean volume one cannot have exact chiral symmetries without this decoupling and 
one cannot get the violations of anomalous conservation laws if one has exact decoupling. 
The overlap's way out is to be equivalent to a system containing an infinite number of 
fermions (heavy flavors). 

Recently, a particular truncation of the number of heavy flavors has been applied to 
numerical QCD [5] in four dimensions, to the two dimensional two flavor Schwinger model 
[6,7] and compared to the overlap in [7]. The fermions in this truncation are typically 
referred to as domain wall fermions. This terminology is a residue of a very influential 
paper by Kaplan [8] , who started this whole subset of activity in lattice field theory. The 
fermions used in [5], [6] and [7] are also sometimes referred to as Shamir fermions [9,10]. 
They are slightly different from the original "Kaplan fermions" in that they correspond 
to a particular limiting case where a certain unimportant free mass parameter is taken 
to infinity [11]. This makes one of the two ground states needed for the overlap trivial 
and independent of the gauge field background, leaving only the other ground state as the 
carrier of all the information typically residing in the closed fermion loop vacuum diagrams. 

The main approximating feature of these systems is that one uses only a finite number 
of fermions and nevertheless expects to get an essentially chirally symmetric theory. As 
emphasized in [7] it is important then to compare carefully the truncated version to the 
overlap. This was done numerically in [7] for a toy model. On the other hand, recent 
numerical work in the truncated model [5] for QCD produced promising results. The 
simplicity of the main formula in [1] indicates that the comparison first undertaken in [7] 
at the numerical level can be attempted also at the analytical level. In view of the work 
in [5] such an analytical comparison is needed, given the difficulty to simulate the overlap 
directly in four dimensions. This leads us to the purpose of this paper, namely, to improve 
our understanding of the nature of the approximation introduced by the truncation and 
of its limitations. 
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Figure 1 Schematic phase diagram for the overlap. In the triangular area we have 



exact chiral symmetries. The possibility for a continuing line from the 
point A is indicated. Along that line one of the mesons could be massless, 
but exact chiral symmetry is not necessarily restored there. The phase 
diagram could be much more complicated. In the continuum limit one 
would approach the interior of the segment BC for strictly massless quarks. 
To describe a continuum theory with a positive massive quark the endpoint 
C should be approached from the outside. Immediately to the left of line 
AB d doublers become massless, while the state associated with the origin 
of momentum space becomes heavy. 



To get a visual image for what the overlap does and how the truncations approximate 
that consider the three schematic phase diagrams in figures 1,2,3. The structure of the 
phase diagrams is basically guessed and the guess is quite incomplete by itself. Probably, 
numerical simulations of QCD would benefit from an investigation of the phase diagram 
as a whole, at least in the regime of /3-gauge couplings that are practically relevant. 

The continuum limit is obtained as (3 is taken to infinity. When f3 is small the link 
matrices fluctuate strongly, lattice effects are important and the whole concept of chirality 
looses its meaning, m is a parameter that controls the dominating correlation length 
among the fermions when the gauge forces are turned off. The light fermion has a mass 
that decreases with m in that case. When gauge interactions are turned on we choose to 
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Figure 2 Schematic phase diagram for the truncated overlap, with a large number 
of heavy flavors. In the triangular area we have approximate chiral sym- 
metries, with the approximation improving towards the center. This is 
indicated by the gradual change of shades. To the left of the line AB one 
also expects approximate chiral symmetries but they are accompanied by 
an unwanted increase in the number of Dirac copies. On the center line one 
of the "pion" states becomes massless, but, exact chiral symmetries exist 
only at the origin. 

preserve CP invariance and keep m real. In the continuum the physics for positive and 
negative m could differ substantially. 

In the overlap (figure 1) there is a region where one has exact chiral symmetry at finite 
lattice spacing. To simulate numerically QCD one only needs chiral symmetries (ignoring 
for the moment the nonzero light quark masses) in the continuum limit, that is at f3 = oo. 
The advantage of having chiral symmetry at finite (3 is that the approach to continuum 
is faster as the leading lattice scaling violations come, in the case of Wilson fermions, 
from operators of dimension five, which also break chiral symmetries. Eliminating chiral 
symmetry breaking also eliminates these operators. In contemporary parlance the overlap 
provides automatic "nonperturbative 0(a) improvement". In practice this has been seen 
to work in two dimensions (see Fig 1 of [12]; footnote 3 on page 110 there notes the 
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Figure 3 Schematic phase diagram for the truncated overlap with a small number of 
heavy flavors. The triangular area where we have approximate chiral sym- 
metries has shrunk relatively to figure 2. The mass of the quarks decreases 
towards the center, but is much higher than the mass in figure 2. On the 
center line one of the "pion" states becomes massless, but exact chiral sym- 
metries exist only at the origin. The shrinkage of the triangle indicates the 
developing need for more and more accurate mass tuning as the number of 
heavy flavors is decreased and the regular Wilson case is approached. 

importance for QCD). 

When f3 is decreased, the links behave more or less like rll where r is a positive real 
number less than one, decreasing towards zero, and U is unitary. This induces a reduction 
in the range of masslessness, until, it is conjectured, the range shrinks to zero at the point 
A. For coupling constants f3 below /3a there are no massless quarks any more. The mesons 
are likely also all heavy, except the possibility of a Wilson critical line. One manifestation 
of the regime above -k- is the absence of instantons as detected by the overlap fermions [4] . 
Pa is not known at present, but, for SU(2) in the quenched approximation, it is apparently 
smaller than values of f3 of numerical interest [13]. Therefore, usage of the overlap appears 
viable in four dimensions (QCD) even with presently available computing power. 

To the left of the line AB one expects different sets of degenerate doublers to become 
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massless. At k = oo there likely are more transition lines there. Our discussion below 
ignores this region of parameter space as it is quite unclear whether it would be of practical 
use in simulations. 

With the truncation, exact chiral symmetries are lost and dimension five operators 
come back in. For large enough numbers of heavy flavors the quarks get small masses, and 
the coefficients of the dimension five operators are likely small numbers. There no longer 
are sharp demarcation lines connecting A to B and A to C. These lines are replaced by 
crossovers. However, a new critical line appears connecting A to the origin. This is just 
the ordinary Wilson critical line. As the number of heavy flavors is further reduced the 
standard Wilson situation is approached. 

No claim is made that the above sketches are completely correct. It is hoped that they 
do capture some essential features that we should keep in mind when reading the rest of 
the paper. In the next section the effective action will be discussed and some new results 
will be presented. The technical device consists of some determinant formulae derived 
in the appendix. The following section presents various explanations of the mechanism 
that keeps the regulated theory close to a chiral limit. The final section contains some 
conclusions and suggestions for further research. 



2. Effective Action 

Using some manipulations on determinants we derive an expression for the effective 
action induced by integrating over all fermions in the truncated model. Our objectives 
are: 

• Make no direct use of operator formulae. The introduction of the auxiliary Hilbert 
space is necessary in the infinite flavor case [3]. It has been reused in [10] for the 
truncated case where it is not necessary, and obscures the simple fact that all we are 
doing is studying lattice QCD with several flavors mixed in a certain way. Once we 
understand this it should come as no surprise that the mechanism for suppressing one 
of the quark masses is well known in ordinary continuum field theory and conceptually 
requires no extra dimensions. 

• We wish to make direct contact with the exactly massless case [1] and see how close 
we would get to it for typical gauge backgrounds. 

• The formulae provide the starting point for finding expressions for the fermion corre- 
lation functions. 

• The overlap formulation is related to the path integral one by a subtraction removing 
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the effects of most of the heavy quarks [2,3]. The expression for the effective action in 
[1] includes this subtraction. It is crucial to carry the subtraction out correctly if one 
wants to reproduce instanton effects. The subtraction of [2] was mentioned in [9] but 
was viewed as unessential. The adaptation of the prescription of [2] to the truncated 
case in [10] is not quite right, as first pointed out in [7]. We show explicitly that the 
subtraction in [7] is the most natural one. 
• The expressions we arrive at will allow the introduction of two kinds of mass terms, 
and will establish that a proposal to look at the spectrum of the operator 1 + V in [1] 
could indeed show how spontaneous chiral symmetry breakdown occurs, separating 
cleanly this effect from those of nonzero global gauge field topology. 
In [2] an infinite mass matrix was introduced which was shown to be equivalent to 
Kaplan's [8] fifth dimension formulation. There were two parameters (a± in the notation 
of [2]) of opposite signs that were needed. While one was bounded, the other was not, 
and could be taken to infinity. This results in a simplification [11] and is equivalent to 
so called "open boundaries" from the fifth dimensional viewpoint. These open boundaries 
have appeared in the literature before, in the present context in [14], but, according to [15] 
also much earlier [16]. As an application to vector like theories open boundaries were first 
employed in [9] and [10]. 

The fermionic actions we shall look at are of the form: 



2k 



(2.1) 



s=l 



The fermionic fields, $ s , have the following left-right structure: 
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(2.2) 



xf' L are left or right Weyl fermions in the notation of [3]. Similarly one defines $ s . Our 
convention is that vector gauge interaction appear diagonal in D. We suppress all space- 
time, spinorial and gauge indices, displaying explicitly only the left-right character and 
flavor. 

The lattice is taken to have L d sites. Our basic building blocks in the matrix will have 
size q x q where, in d dimensions (depending on context, we shall often implicitly assume 
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d = 4) q = 2 a l n c L d . n c is the dimension of the gauge group representation (n c 
QCD). Following [3] we write: 
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(2.3) 



The matrix D is of size 2k x 2k where the entries are q x q blocks. 

The matrices B and C are dependent on the gauge background defined by the collec- 
tion of link matrices U^{x). These matrices are of dimension n c x n c . \i labels the positive 
d directions on a hypercubic lattice and U^(x) is the unitary matrix associated with a link 
that points from the site x in the /i-direction. 



d d 
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(Bo)xai,y/3j ~ 77 <W t 2 ^ 
/*=1 
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y ,x+ jX\U ix\ x j) ij 
(B) xai,yf3j (Bo)xai,y/3j ~(~ -^0^xai,y/3j 



>x,y+{iy u fx 



(Ul(y)) 



(2.4) 



The indices a,/? label spinor indices in the range 1 to 2a -1 . The indices i,j label color in 
the range 1 to n c . The Euclidean 2^ x 2^ Dirac matrices 7 M are taken in the Weyl basis 
where their form is 



7m = 



(J, 







(2.5) 



Of particular importance is the parameter M . As long as M > the matrix B 
is positive definite due to the unitarity of the link variables. To make almost massless 
quarks on the lattice one also wants [8, 2] |M | < 1. (Note that the notational conventions 
adopted here are slightly different from [8] and [2]: The parameter M often appears as 
1 — mo and the parameter m in figures 1,2,3 although meant there more generically, is 
just Mq here.) Although several of the manipulations require that B be nonsingular, and 
therefore one would restrict M to the interval (0, 1) (this is probably an overkill) the final 
expressions are meaningful for the entire range (—1,1). 
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It is convenient to introduce the 2q x 2q matrix r^+i representing the regular 7^+1 
matrix on spinorial indices and unit action on all other indices. In terms of q x q blocks 
we have 



1 
-1 



Setting X = Y = in eq. (A. 15) we get: 



detD = (-) qk (detB) k det 



1 -r 



d+l 



i + r 
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Writing 



we obtain 



T = e 
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° B 



C^C + B 



det D = (-) qk (det B) k det [l + e kH ] det 



1 + Td+l 

tanh(ftf) 



(2.6) 



(2.7) 



(2.1 



(2.9) 



Comparing to the overlap formula in [1] , we conclude that we want the subtraction to 
remove all factors but the last in the above equation. As we shall see below, the large k limit 
is then precisely given by the overlap. The factors we wish to cancel out exactly correspond 
to the determinant induced by integrating over the fermions of a system identical to the 
ones we dealt with up to now, only that the boundary conditions at s = 1 and at s = 2k 
have to be chosen as anti-periodic. This should come as no surprise, since the factor 
det [l + e kH ~\ above clearly corresponds to the trace of the transfer matrix T and a trace 
is implemented by anti-periodic boundary conditions when the integration variables are 
Grassmann. 

The subtraction is handled by adding 2k pseudo- fermions, i.e. fields that have identical 
index structure to fermions, only their statistics is assumed to be of Bose type. The pseudo- 
fermions are coupled by a matrix D p ? . If the subtraction is implemented in a Monte Carlo 
simulation it will be important that the pseudo-fermion determinant not change sign as a 
function of the gauge background. 
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Setting X = Y = 1 in equation (A. 15) we get: 



det D pf = (-) qk (det B) k det [l + e kH ] 



(2.11) 



Sign changes would be avoided if B is a positive matrix. 

We now obtain the effective action for the truncated model: 



detD 
det DPf 



= det 



l + r d+1 tanh(f#) 



(2.12) 



To connect to the overlap we let k — > oo and obtain: 



det£> 
det DPf 



det 



1 + T d+1 e(H) 



(2.13) 



This formula is identical in structure to the main result of [1]. There is a difference 
though: Here H is more complicated and not strictly local. The difference reflects the 
usage of discrete flavor here as opposed to continuous flavor in [1] . With continuous flavor 
H simplifies significantly. In a simulation however, discrete flavor is more appropriate. On 
the basis of the above we can also write down the effective action for the truncated system 
in the continuous flavor case: 



detD 
det DPf 



det 



1 + T d+1 tanh(AiT) 



(2.14) 



The parameter A represents the finite range of continuous flavor. In the overlap, A is taken 
to infinity. The matrix H' above is simply related to a Wilson-Dirac lattice operator, X: 



T d+1 H' = X= (_^ f 



(2.15) 



The block B' is the same as B introduced before, only the parameter M is taken to 
the range (—2,0). Therefore, for continuous flavor, we deal only with negative masses, 
while, for discrete flavor, from the d- dimensional point of view, we can restrict ourselves to 
positive Mo only. From the d + 1-dimensional point of view one sometimes adopts if one 
thinks in terms of domain walls one would say that one always has a negative mass term. 
But, as far as questions of positivity go, the d + 1 terminology is misleading. 

Turning back to discrete flavor we arrive at a criterion for when the truncated expres- 
sion is a good approximation to the exactly massless system: 



k » max 

t 



1 



log\t\ 



(2.16) 
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The maximum is taken over all eigenvalues of T, t, where T is given by equation (2.8). As 
long as the gauge configurations are smooth in the gauge invariant sense, one expects a gap 
in the spectrum of T around 1 and the criterion is not very restrictive. For large enough 
gauge coupling (3 one expects such configurations to dominate. Thus, if we had a method 
of generating gauge configurations, each one from scratch and correctly distributed, the 
criterion might end up to be satisfied in practice for all configurations with a reasonably 
small k (in the last section some suggestion for lowering the needed k even further are 
made). But, the real life simulation methods are based on a walk in the space of gauge 
configurations. Thus, gauge configurations evolve at some rate in the space. This evolution 
has to produce, with the correct probability, configurations that approximate continuous 
backgrounds carrying nontrivial topological charge. During the evolution between two 
configurations that carry different topological charge the gauge fields must pass through 
points where they are very different from a smooth background in the sense that they have, 
at some location, a structure that would be interpreted in the continuum as a singularity. 
In the vicinity of those configurations the matrix T must have one eigenvalue at least that 
is very close to unity. This is so because the number of eigenvalues of T that is smaller 
than unity changes when one goes from a configuration that carries one topological charge 
to a configuration that carries another (the geometric mean of the eigenvalues of T is fixed 
by detT = 1). This is a problem noted in [7] on the basis of experimentation with the two 
flavor Schwinger model. While it is essentially an algorithmic problem, it is an old one, so 
an immediate clean resolution is not very likely. 

For any finite k the partition function never vanishes and, actually, stays positive. 
In the infinite k limit robust zeros appear in instanton backgrounds. For this, accurate 
subtraction is essential. The interpretation of the finite k system is that it contains k — 1 
heavy quarks and one light quark (we ignore the heavy doublers here). The light quark 
is almost massless when k is large. The mass of the light quark is positive and vanishes 
exponentially as k increases. 

Although the light fermion has a small mass for any finite k, one may want to add yet 
another mass term, fi. In particular, one may wish to study what happens when the mass 
is allowed to go negative (more precisely, what happens when, for small fi, the combination 
fie ld goes negative - here 9 is the famous theta-parameter) . 

The simplest way to add another mass parameter is to add a direct coupling between 
the left and right components of the would be massless quark. The most natural interpo- 
lating fields for the massless quark were defined in [2] : They are the fields at the "defect" in 
Kaplan's picture. The appropriate mass term was introduced in [4] and shown to have (as- 
suming its sign is positive) the needed properties to ensure Nussinov-Weingarten-Witten 
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mass inequalities [17] at the regularized level. 

To identify the left and right components of the would be massless quark in this case 
is easy: Set B = in the expression for D. It is evident then that one has at s = 1 a 
massless right handed fermion and at s = 2k a massless left handed fermion. In addition 
there are k — 1 massive Dirac fermions. Actually, every one of the Dirac fermions comes 
in 2 d copies because of lattice doubling. B takes care of the extra massless 2 d — 1 copies 
and makes them heavy. B also couples the two remaining light Weyl fermions, but the 
coupling is indirect and only a small mass is generated. This mass vanishes as k — > oo. To 
maintain a mass also in this limit, similarly to [4] we introduce fx via the matrices X and 
Y of the appendix. 

The new operator acting on the fermions is then D(fS) given by: 
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(2.17) 



Clearly, the pseudo-fermion operator, D p f is just D{ji = 1). Therefore, the subtraction 
simply removes k massive Dirac fermions (not counting doublers). Note that one removes 
k heavy particles although the original system had only k — 1 heavy particles. In this sense, 
the remaining system can be thought of as "doubly" regularized: in addition to the lattice 
one also has a Pauli-Villars regulator. Here, we draw a distinction between the "latticy" 
concept of a pseudo-fermion and the continuum concept of a Pauli-Villars one. In [9, 10, 
7] any fermion with wrong statistics is referred to as a Pauli-Villars fermion. These papers 
also include a /U-mass term. 

Manipulations similar to ones already employed yield: 



det D(fi) = (-) 9fc (det B) k det [l + e kH ] (1 + /u) q det - 



l + r d+1 ^-^tanh 
1 + [i 



2 



(2.18) 

Clearly, we take \fi\ < 1 with < \x < 1 representing positive masses and — 1 < ji < 
representing negative masses, where, by definition, we take 9 = 0. Note that, just 
like in [4], the subtraction of pseudo-fermions does not depend on \i. This is important, 
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since the determinant associated with the subtraction must have a definite sign, while the 
determinant of the original fermions must change sign for negative masses when the gauge 
fields change topology by one unit. Let us remark that using the general matrices X and Y 
of the appendix one can similarly derive left-right correlation functions for the light quark 
by differentiation. This bypasses the need for operators. 

To see the relation between topology and mass sign come out we ought to dispense 
with the other source of finite mass, namely k. We go to the overlap then, setting k = oo, 
and obtain: 



d(fi) = 



D{/m) 



D(ji = 1) 
Assuming det H ^ we obtain: 



fc=oo 



= det^[l + /j + (l-/j)T d+1 e(H)}. 



(2.19) 



d(fj) =det \T d+1 e(H)]d(-n). 



(2.20) 



In [3, 1] we defined the topological charge n top as half the difference between the number 
of positive and negative eigenvalues of H. This implies 



(2.21) 



confirming the interpretation of the sign of fi as the sign of the physical mass. Similarly, 
one could treat complex masses, or (in the case of several light flavors) mass matrices. 
The simplicity of the formula raises the hope of possible lattice investigations of QCD at 

9 = 7T. 

We immediately learn now how to add a mass term to the continuum flavor case where 
fx is the single source for a mass: all we need to do is use in d(/j,) the simpler matrix in 
[1] which plays the role of H here (we denoted this matrix by H' in (2.14-15)). Also, with 
V = r<i+i€(H'), we see, by taking derivatives with respect to \i at /j, = 0, that spontaneous 
chiral symmetry breakdown should indeed be found in the spectral properties of 1 + V, 
establishing the validity of a conjecture in [1]. 

Indeed, for Nf degenerate flavors / we have: 
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Nf 



N f 

/=i 



physical ^ = ^ det ^ 



1 + V 



Tr 



1-V 
1 + V 



(2.22) 



Here < .. >_a means an average with respect to the pure gauge action and Z is a renor- 
malization constant. We see how single instantons would give a nonzero contribution for 
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one flavor, but no contribution for more flavors, just as expected from the more formal 
continuum expressions. 

Note that the factor that appears traced is j^y rather than just j^y, but since the 
interesting regime is at eigenvalues of V close to —1 the difference can be absorbed in Z. 
Let us look at the difference between and j^y in the free case*: Write, for the free 
case, 

V = Z7 ^ + M (2.23) 
where and M can be function of the momenta p u . One gets then, 

This expression is close to the continuum in that it anticommutes with 75. M is chosen 
to be nonzero and positive at the location of all four-momenta which make = except 
the zero four-momentum point where M is negative. The elimination of the doublers is of 
the same type as first proposed by Rebbi [18]. Since j^y = j^y + 1 the former expression 
has a remnant of chiral symmetry of the form discussed some time ago by Ginsparg and 
Wilson [19]. 

To be sure, neither expression appears to be a complete and unique replacement for 
the massless continuum fermion propagator (in the presence of gauge fields). Actually, 
it is not certain that such an object really exists or is at all necessary. Of course, there 
does exist a full fermion propagator, including all heavy fields in addition to the physical 
field. This propagator can be obtained directly from the D matrix. However, if we wish 
the propagator for, say the chiral components at s = 1 and s = 2k all other fermions 
need to be integrated over, just like in [2]. The relevant expression could be obtained by 
using appendix A to couple external sources to the desired bilinears, and differentiating 
subsequently the exact formulae for the determinants with respect to the sources. From the 
expressions we derived until now we learn that for the purpose of computing the effective 
action we can use as a P r °P a gator of the single physically interesting fermion, while, 
for the < (^V')phys > condensate, the role of the physical propagator factor is played by 

— 1, although, the determinant factor is still det 

Supersymmetric theories with no chiral matter contain fermions in real representations 
of the gauge group. The simplest case is theories with no matter at all. As emphasized 
by Curci and Veneziano [20] supersymmetry should be restored in the continuum limit if 



* This is an observation made by Ting-Wai Chiu in a private communication to the 
author. 
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enough ordinary symmetries are preserved on the lattice. Thus, as noted in [4], the overlap 
could be used for supersymmetric theories. Here we wish to present simplified formulae of 
the type in [1] for the effective actions for the supersymmetric case, both in the truncated 
models and in the overlap limit. For other work with Majorana fermions see [21]. 

In the supersymmetric case of pure gauge we have fermions in a real representation 
so the Ufj, matrices are real. Let us restrict our attention to d = 4 for definiteness. What 
we need then is a square root of the determinant that is analytic in the link variables. 
The analyticity in the link variables assures that no unwanted new terms would come 
into Ward identities involving the link matrices: If, for example, we just took the square 
root of the absolute value of the determinant, in the vicinity of configuration for which 
the determinant vanishes we could generate delta-functions of variables if we take 
derivatives with respect to a sufficient number of times. Such terms may spoil a 
smooth approach to the continuum limit, by refusing to disappear. Following [22] we are 
looking for a decoupling of the fermionic functional integral in the case of Dirac fermions 
in a real representation of the gauge group. Such a decoupling is achieved if one finds a 
basis in which the matrix still has entries analytic in the but where it is antisymmetric. 
The analytic square root is then the pfaffian. 

We first make a simple basis change, reversing the order of the rows of q x q blocks in 
the matrix D{jj). 



D r (n) 



Clearly, 
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det D r (ii) = (-) kq detD(n). 



(2.25) 



(2.26) 



Because of the reality of the representation, U^(x) = U*(x) for all \x and x. The B 
block is therefore real, while the C block obeys 

d 

C = J2 a » W ^ ( 2 - 27 ) 
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where = — Wj = W* with W T meaning the transpose of W. In four dimensions we 
choose or 4 = i and cr 12 ,3 as the standard Pauli matrices. Therefore a 2 a il o 2 = — cr*. We 
learn that 

a 2 C T a 2 = - C f 

dT (2-28) 
o 2 Bo 2 =B . 

Define a block diagonal 2k x 2/c matrix E 2 with qxq blocks given by a 2 on spinorial indices 
and unity on all other indices. The matrix we are interested in in the supersymmetric case 
is £>susy(aO = ^ 2 D r (fi), 



£>susy(» 



Here, 



Since det a 2 = —1, 
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B a = a 2 B = 


-B° T . 





(2.29) 



detD(p) = (-) k i detD SVSY (p). 



(2.30) 



(2.31) 



Since -Dsusy (aO is antisymmetric, we obtain, for the supersymmetric case, the analytic 
square roots pf [-Dsusy (/•*)]• This implies that the ratio of determinants in the subtracted 
system also admits an analytic square root, and the effective action in the "truncated" 
supersymmetric case is: 



± 



Pf [Asusy Q/ = 0)] 



pf[D SVSY (p = l)] \ 



det 



l + r d+1 tanh(|#) 



(2.32) 



There is an overall sign on the left hand side that we have not determined; this is not 
important since this sign is independent of the gauge field because on the right hand side 
we have an expression that will not vanish for any set of links U^. (The single way the 
sign on the left hand side could depend on the gauge field would be for the expression 
to vanish when one gauge field configuration is deformed into another.) The reason the 
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expression under the square root on the right hand side cannot vanish is the tanh term, 
which ensures that the operator T d+1 tanh(|.ff) has norm less than one for any finite k. 
This is just another way to see that the would be gluinos have a finite positive mass when 
k is finite. Thus, we shall not have exact supersymmetry for any finite k in the continuum 
limit. 

When k is taken to infinity the norm becomes one and zeros become possible (and 
actually expected to occur in instanton backgrounds). But, since sign changes as a function 
of the gauge background were not possible for any finite k, sign changes are prohibited 
also in the infinite k limit. This is in agreement with the expectation from continuum [23]. 
Therefore, we expect a supersymmetric theory in the continuum limit, when the gauge 
coupling (3 is taken to infinity and the mass parameter Mq is kept anywhere within the 
finite interval (0, 1). In practice, one will be working at a finite (3 and then the range for 
M is different, as sketched in the introduction. Also, unless some new simulation trick is 
discovered, we would be working at a finite k, large enough that the effects of finite gluino 
mass are negligible when compared to other sources of statistical and systematical errors. 

The left hand side of the above equation tells us that the expression is not only not 
changing sign but also is analytic (more precisely, for finite lattices and finite k it is a ratio 
of polynomials in the link variables). 

In the massless limit (k = oo) and going over to continuous heavy flavor space we 
obtain the following compact expression for the fermionic determinant on the lattice (the 
equality holds up to an irrelevant overall sign): 



1 + T d+1 e(H>) 



'det 



= pf 



<?2 






&2 



+ ^SUSY 



(2.33) 



The unitary and antisymmetric matrix Vsusy is given by: 

1 



^susy = y 



WW 



— (J2O 1 (72-D J 



(2.34) 



The above is a direct generalization of results in [24] . 

Lattice approaches to supersymmetry in different contexts have been discussed also 
in [22, 25, 26, 27]. Some of these papers employ ordinary Wilson fermions and there the 
issue of positivity on the lattice (versus the one in the continuum [23]) becomes somewhat 
murky.* 



* I thank Istvan Montvay for an e-mail discussion of this point. 
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In our equations the link matrices where taken in the adjoint representation. In the 
case of SU(n c ) these {n 2 c — 1) x (n 2 c — 1) matrices can be expressed in terms of the n c x n c 
link matrices in the fundamental representation as is well known from elementary group 
theory. Such a representation is natural if one keeps the bosonic gauge part in terms 
of link matrices in the fundamental. There is no reason to do that for supersymmetric 
theories and it is more natural to take the Wilson plaquette term also in the adjoint. 
If this is done, lattice gauge field configurations that approximate nontrivial continuum 
SU{n c ) / Z(n c ) bundles on tori can be dynamically generated. This may be important in 
the continuum limit [28], in particular if we view the approach towards the continuum 
limit as taking place on a four-torus of fixed physical size. 



3. Mass Matrix 

In this section we deal with the mass matrix in the absence of the parameter \x. Our 
main objective is to see that indeed one gets one light quark and k — 1 heavy ones. Actually, 
what we are really interested in, is to identify the mechanism that is capable of preserving 
the large hierarchy between the heavy fermions and the light one. Phrasing it this way, 
we see that our problem is very similar to well known problems faced in Particle Physics. 
It should come as no surprise then, that we shall conclude that so called domain wall 
fermions in the truncated context are nothing new. On the other hand, by identifying the 
mechanism that preserves the hierarchy at finite k, we shall gain confidence in this way of 
regularizing almost massless quarks on the lattice. Moreover, usage of standard Particle 
Physics concepts is usually advantageous in lattice work. 

Some workers [5] seem to believe that the overlap is somehow exclusively restricted to 
chiral gauge theories. Logically this is almost impossible, and, indeed, a significant part of 
section 9 in [4] was devoted to vector-like theories. It was shown there that for Nf massless 
flavors an SU(Nf)u x SU(Nf)L x U(1)r+l is preserved exactly at finite lattice spacings, 
thus, not only exhibiting exact chiral symmetries (evidence for exact masslessness of the 
quarks), but also the explicit breaking of U(1)r-l induced by gauge topology. 

The heart of the mechanism that preserves the zero mass of the quarks on the lattice 
was identified in [2]: In the infinite internal space the mass matrix has a nontrivial index 
(associated with a quantum mechanical supersymmetry in heavy flavor space) which must 
be stable under small but finite radiative corrections (there is a finite ultraviolet cutoff). 
Essentially, there exist well localized states in the internal space which represent unremov- 
able zero modes and create the left and right components of the massless quark. When 
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flavor space is truncated the index is lost and the massless quark acquires a small mass. 
The question is what keeps it small once gauge interactions are turned on, and the index 
is absent. 

Clearly, when the internal space is very large, but not infinite, one would expect the 
mechanisms that do the job for the untruncated situation to be still at work and prevent 
the mass of the light quark from becoming too heavy. Otherwise, an unlikely discontinuity 
in the mass spectrum would occur as internal space expanded to infinity. It was shown in 
[2] that fermion propagators in internal space were exponentially decaying in the infinite 
directions and therefore perturbation theory could not generate such a discontinuity. Going 
from infinite internal space to finite internal space changes the propagator somewhat [9] 
but nothing much can happen given the exponential decays mentioned above. To obtain 
the effects of the truncation on the infinite flavor space propagator of [2] one could use the 
method of images. 

So, it looks plausible that a large hierarchy can be preserved in the truncated case. 
But, once we have discrete flavor and truncated flavor space, it seems absurd to think 
about flavor as a space approximating something infinite. In this context one often finds 
statements about the truncated model describing it as a domain wall fermion model with 
exact chiral symmetries in the limit of an infinite distance between the walls. This creates 
an illusion that the number of flavors is just a parameter like a mass, and we can take 
it at will wherever we want. Clearly, the number of fields in any model is not a usual 
parameter and taking it to infinity is not a simple procedure. So, then, what would be a 
better way to think about the truncated model ? The answer is simple: it is a model with 
a special mass matrix designed to preserve a large mass hierarchy. It should be possible 
to understand how it works without referring to infinite flavor limits. Once we gain this 
understanding we can trust the mechanism to work on the lattice. 

We shall look at the mass hierarchy from three new points of view: The first employs 
the theory of orthogonal polynomials and is somewhat mathematical. The second shows a 
connection to the well known see-saw mechanism [29] . The third establishes a link to the 
Froggatt-Nielsen [30] mechanism. 

The mass matrix M is made up of all terms coupling left to right handed fermions in 
D. By permuting indices, one brings D into the more convenient form below: 



Here C, are of size kq x kq with trivial action in flavor space. M is a truncation of the 




(3.1) 
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infinite mass matrix of [2]. 



( B 



M = 




B 
-1 B 



\ 


















(3.2) 



-1 B) 



Let us consider diagonal B matrices (free fields in momentum basis) and compute the bare 
masses. We introduce new notation, replacing the blocks by numbers. This is permissible 
since the blocks are diagonal now. 



(b 

a b 
a 



M 






b 



0\ 





(3.3) 



\0 ... a bj 
The mass eigenvalues are extracted from M^M. 

M^M = (a 2 + b 2 )l + 2ab( J - ?-N k ). 



(3.4) 



J is a truncated Jacobi matrix. The full Jacobi matrix is tri-diagonal and generates 
a sequence of orthogonal polynomials. In this case the orthogonal polynomials are the 
Chebyshev polynomials of second kind. The matrix Nk contains information about the 
truncation. Nk has all entries vanishing except the (k,k) entry which is equal to \. 
Similarly, we define a matrix N\ whose single nonzero entry is at (1,1) where it is again 
equal to \. MM^ obeys a similar formula as above, only Nk is replaced by Ni. J is defined 
by: 



2J = 



1 



Vi 



l 
o 
l 



o 
l 





\ 



(3.5) 



M is not diagonalizable: 



[M\M] = 2a 2 (N 1 -N k ). 



(3.6) 



When k is taken to infinity and one requires square summability over flavors, the 
term Nk in the above equation gets replaced by zero. From the point of view of the 
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light degrees of freedom this amounts to keeping only one of the chiral components. By 
relabeling backwards one sees that it is possible to keep only the other chiral component. 
The infinite k limits we took on the determinants before keep both components. 

Suppose only one chiral component is kept: For example, assume that has a 
zero mode |0 >, while M has no zero mode. There is a nontrivial index and we have 
| < 0|[Mt,M]|0 > | > 0. We conclude that in order to keep only one chiral component, 
it is necessary (but not sufficient), that tr[M\ M] = ±2a 2 7^ in the limit. Since for 
any finite k, tr[M^ 7 M] = 0, these limits are not smooth and require the apparatus of 
operator transfer matrices to give them a proper interpretation. Here, we are satisfied 
keeping tr[M\ M] = even at infinite k and there are no subtleties in taking the limit. 

The overall scale of M does not matter, so the single parameter we have is w = |. 
Let e^) be the j th eigenvector of M^M. 



e U) = 



U 2 



(J-wN k )e {j) = A je (j) . (3.7) 



\UkJ 

The equations satisfied by the components Ui, with uq = and Uk+i = —wiik, 

ui + us =2\u 2 



(3.? 

Uj-i + Mj+i =2Xuj 



M fe _i + u k+1 =2Xu k 

are the recursion relations for the Chebyshev polynomials, and the initial condition selects 
the Chebyshev polynomials of second kind. 

. . . , ... sin(z6*) / sinh(i60\ 
Ui oc Ui (cosO oc Ui(cosh9)) = = . 3.9 

sin(fc') \ smh(0) / 

The argument 9 is determined by the k th equation which is equivalent to 

U k+1 (X)+wU k (X) = 0. (3.10) 

Let Xj be one of the roots of the above equation. Then, the eigenvalue of (J — wN k ) 
associated with the (j) eigenvector is cos(0) = Xj (cosh(0) = Xj). 
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As A varies between —1 and 1 the ratio %^p^ S oes through k zeros and k — 1 poles. 
This implies at least k — 1 solutions for any iu. The single question remains whether a /c th 
solutions fits into the interval. This depends on the value of w and the values of the ratio 
at the end points. It is important to realize that, as long as the Jacobi matrix generates 
orthogonal polynomials with a positive measure the structure will be the same. Our case 
is particularly simple, so we can be more explicit, but the structure would hold for much 
more general mass matrices. Explicitly, we have two cases: 

i \w\ < 1 + \: all roots are in [—1, 1]. 

ii \w\ > 1 + \: there are k — 1 roots in (—1,1) and one root outside this interval. The 
outside root has the opposite sign of w. For large k, due to the rapid growth of the 
Uk polynomials outside the [—1,1] interval we have, up to exponential corrections (in 

k), 

Xk^'liw + w- 1 ). (3.11) 

These eigenvalues determine directly the eigenvalues of M^M, m 2 RL . Thus, for \t\> 
1 + j: we have k — 1 masses obeying (a ± b) 2 < m 2 RL < (a =F 6) 2 , i.e. typically, they are 
all bounded away from zero. There is one mass outside the interval, which, up to relative 
corrections exponentially small in k is given by 



m l L = T^k+2(a 2 -b 2 ) 2 , (3.12) 

If If I < 1 + t all k masses are bounded away from zero by the smaller among (a ± b) 2 . 
The above equation for m 2 RL is derived by setting \w\ = e B (l + 5) (6 > 0) and computing 
5 to leading order in In the free case, at zero momentum, b = M and a = — 1, so 

that 

m 2 RL = \M \ 2k (l - |M | 2 ) 2 (1 + O(\M \ 2k )). (3.13) 

The above formula was first derived in [7]. To be precise, we really only computed at 
strictly zero momentum, so (3.13) also contains a wave function renormalization constant. 
But, from (2.22) we can infer that the latter has a finite large k limit. 

The theory of orthogonal polynomials indicates a certain robustness in the above 
structure. Let us focus on M^M. Assume it is a moderate deformation of the above 
very simple structure. As a first step apply Lanczos tri-diagonalization starting from the 
eigenvector corresponding to the almost massless state in the unperturbed case. As long 
as the deformation is small, the positivity of the measure associated with the tri-diagonal 
matrix (viewed as a Jacobi matrix) will be maintained. It is important to understand that 
we think for the time being about a tri-diagonal matrix for any k, in effect and infinite one, 
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with well defined sequences determining its entries. The truncation only enters through 
the matrix N k . Therefore, the robustness of the matrix J is on the same footing as the 
robustness of the exactly massless eigenvalue in the infinite flavor case, where the index 
(sometimes called the "deficit" in older orthogonal polynomials literature) is active. But, 
given this robustness, the effects of the truncation follow simply from the most basic 
properties of any set of orthogonal polynomials. 

We conclude therefore that using the theory of orthogonal polynomials we can convince 
ourselves that for perturbations under which the infinite flavor problem is stable there will 
be two regimes depending on a parameter w: One will have k — 1 massive fermions and 
one very light fermion (for finite but large k). The complementary regime will have k 
heavy fermions. In short, the robustness of the infinite flavor limit implies robustness of 
the hierarchy for large but finite numbers of flavors. 

It should be mentioned here that the entrance of orthogonal polynomials can be un- 
derstood also as follows: At k = oo there is an infinite M^M and the (positive) mass 
square spectrum has a disjoint support in two components. The first component is a 5- 
function at the origin and the second is a segment supporting a continuum with typical 
square root spectral singularities at both ends of the segment. If heavy flavor space is 
continuous in addition to being infinite the upper bound on the spectrum disappears [2] 
and the segment gets replaced by a semi-infinite line. With discrete flavor we can truncate 
to finite integer k and the above spectral weight gets approximated by a discrete set of 
abscissas and associated weights. The 5 function is replaced by one abscissa and shifts 
away from the origin by a small amount. The truncation amounts just to the well known 
Gaussian quadrature we are familiar with from numerical analysis, where the formulae are 
indeed derived using the theory of orthogonal polynomials. 

The above discussion about the mass matrix is somewhat more general than the one 
in [9] and makes no reference to specific properties of the propagator. Nevertheless, it is 
not more illuminating because it is a bit mathematical and does not tell us much about 
not too large /c's, which is, after all, what matters in practice. 

To really understand what goes on for moderate k consider the light mass formula 
for k = 2 and a 2 >> b 2 . Every Particle physicist would be immediately reminded of the 
see-saw mass formula [29] . Therefore, there should exist a more familiar way to understand 
the hierarchy and its robustness. For this we need one more basis change. 

Let us rotate the right handed fermions alone with the intention to make M hermitian. 
Then we can diagonalize M itself, not only M^M or MML A simple relabeling of fields 
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yields: 

/O ... b\ 
... b a 



M = 



... b a 



(3.14) 



\b a ... 0/ 
Assume now that |6| << \a\. We work to leading order in the ratio. First set 6 = 0. 
M 2 is then diagonal and has k — 1 eigenvalues equal to |a| 2 and one eigenvalue equal to 
zero. The nonzero eigenvalues will not be affected much by turning on b. We know their 
leading behavior already. It remains to find out what happens to the lightest mode. Note 
that: 

|detM| = |6|\ (3.15) 

Therefore, 

m2fc 

™ 2 RL = | a |2(fc-l) ( 3 - 16 ) 

to leading order in j^j. The expression is consistent with the previous computation at 
large k and moderate j^j. So, if the ratio of the entries is small, for any finite k we get an 
amplification of this ratio resulting in a large hierarchy. This is the see-saw mechanism. 
It is obvious now that the above considerations would hold if we replaced the constant 
entries b by varying entries bi, and similarly the entries a by Oi, as long as the orders of 
magnitude stay the same. We shall still obtain the same rough hierarchy. 

However, if we add a non-zero entry in the (1, 1) corner of our hermitian M, there is 
a dramatic change, the determinant of M going from being very small to being of order 
a . The light state is completely lost. What promises us that any such entry of M, 
when generated radiatively, will be so small that it will preserve the hierarchy ? Clearly, 
there must be some way to understand the orders of magnitude of all entries of M that can 
be radiatively generated. Some approximate conservation law must protect these entries 
from growing too much as they move away from the main off-diagonal. The mechanism 
that does that has been invented a long time ago by Froggatt and Nielsen [30].* 

The F-N mechanism, can be made to work as follows in our context: In the original 
expression for D replace each S-entry above the diagonal by (ycf))B and each S-entry 
below the diagonal by (y<p)*B. <fi is a unit charged scalar Higgs field. The unit charge is 
with respect to a new abelian group which is spontaneously broken. The fermions are also 
charged under this group; we denote this new charge by Fn. We pick an assignment of 
charges for the fermions as defined in figure 1 for k = 5. 



I thank Y. Kikukawa for reminding me of this mechanism. 
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F-N charge assignments for k=5 
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Figure 4 Fjy Charge assignments of the $ s fields for s = 1, 2, 2k. Connecting lines 
are there just to guide the eye. 

The intuitive picture is best described by first simplifying and ignoring all doublers: 
For some reasonably weak y (not fine tuned, but small) in the presence of spontaneous F^ 
symmetry breakdown, the lattice fermion spectrum can be viewed, in continuum language, 
as follows: There are k — 1 massive Dirac fermions, all having masses of the same order 
(the condensate is of the order of the lattice cutoff). All these Dirac fermions realize the 
Fn (*7(1) symmetry vectorially. There also is one light Dirac fermion whose left and right 
components carry very different F^ charges, thus realizing the £7(1) symmetry chirally. 
These two components can only communicate, for perturbative y, via k "exchanges" with 
the condensate. All intermediate fermions in the diagram are massive. If the ratio between 
the condensate scale < <p > and the heavy mass is sufficiently small relative to unity (but 
not necessarily unnaturally so) we get our hierarchy. 

But, we shouldn't ignore the doublers. They cannot be ignored because the Fn 
1*7(1) is an exact symmetry on the lattice, and therefore, in a continuum limit that also 
maintains the Higgs Geld, it must be realized in a vectorial way to avoid anomalies. On 
the other hand, as long as we do not keep the Higgs field in the continuum limit and 
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are in the spontaneously broken phase, the intuitive picture would lead to a selfconsistent 
perturbation theory. All we really want from this picture is an intuitive reason for why, 
when we calculate, we expect to get a mass hierarchy, and why radiative corrections should 
not be suspected to immediately destroy it. 

More formally, we can eliminate the Higgs field and replace it by a constant and 
just observe that making an global gauge transformation on all the fermions with 
parameter x an d, simultaneously, changing y by the phase e %x leaves the theory invariant. 
Therefore, the two point function involving the left and right components of the quark at 
small momenta which would become massless if y were set to zero, must have a dependence 
on y of the form y k fk{\y\ 2 ', A). A represents the gauge background. By a finite rescaling 
of y we can arrange for fk(0;A) to be of order unity. If the rescaled y is only reasonably 
small (say of order .5), an exponential mass hierarchy gets generated for large fc's. 

There is one problem we have not directly addressed yet, but we must. The above 
arguments indicate that the light quark will stay light when gauge interactions are turned 
on. They also might be taken, in particular in the domain wall picture [10], to indicate that 
the left and right components of the light quark completely decouple. If this is true they 
can be rotated independently, and then we have an inescapable U(1)l-r problem! Since 
we know that this global symmetry is explicitly broken, the arguments cannot be com- 
pletely correct, and if they aren't, how can we trust them to correctly indicate approximate 
masslessness? 

The answer is known at the mathematical level: In the overlap the U(l) problem 
is solved in exactly the way discovered by 't Hooft. The independent rotations on the 
decoupled components of the massless quark are not really rigorously defined in the path 
integral since k is strictly infinite. This creates a loophole just where needed in order 
to make 't Hooft's solution to the axial U(l) problem work on the lattice. Since the 
truncated model approaches the overlap, the mechanism preserving masslessness has to 
allow U(l) breaking just as needed. Moreover, from our exact expression for the fermion 
determinant in the presence of the /U-mass term we see explicitly that gauge configurations 
carrying nontrivial topology make large contributions. For example, for a single flavor, an 
instanton makes an unsuppressed contribution to < (V>V0phys >. 

In the intuitive picture based on the F-N mechanism, the role of nontrivial topology 
of the gauge background can be understood as follows: Suppose we have several copies 
of the whole setup, with an exact vectorial SU(Nf) symmetry. In the intuitive picture 
we described before the F N f7(l), being chiral, is also explicitly broken by instantons 
and the F N charge is not fully conserved. This lack of conservation makes it possible for 
a correlation function containing 2Nf chiral components of the light quarks to become 
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unprotected, and not necessarily small. Of course, on the lattice this picture is not really 
correct since the F N symmetry is not chiral. The single correct statement we can make is 
that we expect those functions fk(0', A) (or their appropriate generalizations for expectation 
values of products of more than two fermion fields) that get instanton contributions to be 
enhanced and avoid the suppression that generates the hierarchy. 

In some loose sense, it seems that the Smit-Swift [31] approach to use Yukawa cou- 
plings for obtaining chiral symmetries in the continuum might work for almost massless 
vector-like theories once it is combined with the F-N mechanism. The Yukawa couplings 
indeed can remove the doublers while keeping the "desired" fermion relatively light. The 
contribution of the F-N mechanism is to take a mass ratio between the doubler and the 
light fermion of order 2 say and amplify it to 2 fc . Since the heavy fermions have a mass 
of order 1 in lattice units, one gets a very light fermion. Thus, what Yukawa models were 
unable to do, even with fine tuning, namely generate some large mass hierarchies between 
charged (under the group that is gauged) fermions and other fermions, is no longer needed. 
Only some "start" in the right direction is needed and then F-N's mechanism can amplify 
the mass hierarchy as much as one wants. If one wishes strictly massless quarks one would 
need infinite amplification, so we are back in the overlap case. Therefore, if we wish to 
regulate a chiral gauge theory we cannot avoid an infinite number of fermions. But, in the 
vector like case, where very small quark masses are sufficient, the combination of the Smit- 
Swift and F-N mechanisms seems useful. So, despite some failures for chiral gauge theories 
[32] (even when combined with truncated fifth dimension models), a re-examination of 
Yukawa models, this time with the intention to provide a cleaner numerical approach to 
almost massless QCD, might produce something new and useful. One may also speculate 
that integrating out the heavy F-N Higgs fields, which would leave behind some multilinear 
fermion interactions, could provide a connection to the approach taken recently in [33], 
also with the purpose to get closer to the chiral limit at finite lattice spacings. It would be 
nice to unify the different tricks people have come up with while attempting to get global 
chiral symmetries respected by the lattice. 

Very recently [34] perturbative calculations in the gauge coupling constant have been 
undertaken to show robustness. Full details are not available at the moment, but the 
conclusion is that the hierarchy is maintained. This is expected, although the explicit 
check is definitely reassuring. But, we should keep in mind that perturbative checks of 
this kind will not be able to address the U(l) problem mentioned above, and therefore, an 
intuitive picture should be welcome. 
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4. Summary and Outlook 

In this paper we derived some exact expressions for the integrals over lattice fermions 
in systems that regularize vector like continuum theories containing very light fermions 
of Dirac or Majorana type. These expressions make explicit the effect of the truncation 
involved in going from the overlap where the fermions are strictly massless to actions 
that can be simulated numerically with relative ease. In addition to the Dirac case we 
also dealt with the Majorana fermions needed to simulate pure gauge supersymmetric 
theories. A criterion for the goodness of the truncation approximation was given in terms 
of certain extremal eigenvalues of the transfer matrix. Potential difficulties related to 
topology changes were noted. 

We also presented several different views that explain the stability of the large mass 
hierarchy between the lattice cutoff and the fermion mass. We found that two well known 
mechanisms, the see-saw and the Froggatt-Nielsen mechanism are alternatives to the extra 
dimension view. This understanding should contribute to our ability to trust and correctly 
interpret numerical results obtained using the truncated overlap. 

Considering the application of Renormalization Group ideas to the truncated overlap, 
we suggest that the blocking transformation should also integrate out some of the k heavy 
fermions.* Indeed, the pure glue correlation length increases exponentially with the gauge 
coupling (3 while the free fermion correlation length increases exponentially with k. In other 
words the thinning of degrees of freedom seems to be naturally applicable to both real and 
flavor spaces simultaneously. A flow in the k — (3 space could determine the anomalous 
dimension of the quark mass. It would be interesting to investigate this further. 

One way to translate our insights into the structure of the mass matrix into something 
practical is to devise ways to accelerate further the speed of hierarchy generation as a 
function of k. In four dimensions it appears that one can obtain useful numbers with 
k ~ 10 — 15 ([5]) while in two dimensions somewhat lower /c's might be adequate, although 
this is not completely clear ([6], [7]). The small effective mass goes essentially as (K(j3)) k 
where < k((3) < 1 is some unknown function. Any trick that reduces k((3) would allow 
a reduction of k without changing the effective mass. At (3 = oo k((3) is governed by the 
ratio of the B L — R term to the —1L — R term in the matrix D. We can make the ratio 

* Specifically: we already know from [35] that using 5-f unction constraints in the fermion 
path integral to implement the thinning of degrees of freedom is a bad idea, but if one uses 
smooth kernels for the same purpose no undesired non- localities get generated. We now 
imagine a kernel that not only implements the usual thinning but simultaneously reduces 
the flavor number k. 
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as small as we want at zero momentum, but there is an increase as we move away from the 
origin. A simple generalization of the action that could ameliorate the effect, and thus, 
hopefully, end up decreasing the required k, is as follows: 

Find two polynomials p(X) and q(X) which have the following properties: 

1. For A G [0,2d], p(X),q(X) > 0. 

2. There exists a real number < r < 2 such that, for A G (0, r), < << 1 while, 
for AG (2, 2d), 0<f$>l. 

Then replace the entries B in D by p(B ) and the entries — 1 by —q(B ). The truncated 
model we analyzed in this paper had p(X) = X+M and q(X) = 1. In addition to being small 
we would also like the ratio < to be relatively constant in (0, r) while it undergoes a 
rapid increase in [r, 2] to some value slightly higher than unity and stays above unity (but 
not necessarily with small variability) in the entire interval [2, 2d]. Increasing the degrees 
of the polynomials carries with it some computational cost, but, it seems that this cost 
could be compensated by needing a smaller /c-value. We hope to return to this issue in 
future work. 

Another way to put the viewpoints of this paper to some use is to reinterpret some of 
the data obtained by Blum and Soni [5] in the quenched approximation. They obtained a 
few values for the pion mass square at different /?'s, various fc's, with values of M chosen 
so that one works in the triangular area of figure 2. For their largest f3 value, they found 
that they needed to increase k from 10 to 14 in order to obtain apparent massless quarks in 
the limit \x = (masslessness was determined by extrapolation from finite \i values) . Since 
we view the system at finite k as having a quark mass coming from two sources, one the 
finiteness of k and the other /j,, the data at k = 10 extrapolated to \x = can be taken as 
an indirect measurement of the "F-N suppressed" quark mass at k = 10. In other words, 
the finite small pion mass is proportional to the sum of the truncation-induced quark mass 
and the explicit quark mass term. A test at a few smaller values of k might have provided 
numerical evidence for the exponential dependence on k. On the other hand it was found, 
at the highest (3 value, that the system behaved as if the truncation induced quark mass 
was negative. This was ascribed to a numerical detectable higher order term in the chiral 
effective Lagrangian (possibly with quenching effects included). A simpler possibility is 
that the truncation induced quark mass is indeed negative. We expect that when one 
works at parameters corresponding to the interior of the triangle of figure 2 to the right 
of the y-axis one induces a positive quark mass, but what happens more to the left is not 
completely understood. It is possible that, at (3 = oo the values —1 < M < induce, at 
finite k, finite negative quark masses (in the sense that includes the QCD 9 parameter). 
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The last conjecture may be a bit mystifying given the overt positivity of the regularized 
fermion determinant in (2.12) for example. So, let us clarify the issue: The basic question 
is whether the matrix B can also have negative eigenvalues for some gauge backgrounds. 
As long as B is positive definite we are sure that the fermion determinant is always positive, 
including for backgrounds containing a single instanton. Therefore, the quark mass must 
be positive. If B has a negative eigenvalue the basic formula T = e~ H , ostensibly defining 
a hermitian H, breaks down. Indeed the expression for T, which (unlike in the transfer 
matrix derivation of [3] where \[B was used at intermediary steps) is now valid as long as 
det B^O shows that T is positive definite iff B is such: 



All formulae containing H can be written in terms of T and are valid as long as det B ^ 0. 
With a T that can have negative eigenvalues the fermionic determinants do not have to 
be always positive any more. On the other hand, the structure of the mass matrix still 
indicates the presence of very light quarks. Therefore, we conjecture that to the left of 
the y-axis we are dealing with light quarks but with a negative mass. It is hoped that this 
point would be clarified in future work. 

Before closing, let us mention that the insights of this paper are hoped (as mentioned 
in [1]) to be of some help also to attempts to regularize chiral gauge theories. 
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Appendix 

The purpose of this appendix is to derive a general formula for the determinant of 
a block tri-diagonal matrix with non-vanishing corner blocks. This formula is analogous 
to one employed by Gibbs and followers in the numerical study of finite density effects in 




(4.1) 
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QCD [36]. Let the matrix D have even dimension n = 2k where each entry is a q x q block: 
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B 2 
C 2 
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B 3 A 3 

C3 -B4 
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n-1 



(Al) 



Moving the leftmost column of blocks into the place of the rightmost column we obtain 
the matrix D'\ 
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D' = 



V ... C n _i B n A n J 
Counting minus signs we obtain, using the evenness of n: 

detD = {-) q detD'. 

We now view D' as a k x k matrix with 2q x 2q blocks, defined below: 
Ai 



(A2) 



(A3) 



Oil 

Pi 



B 2 A 2 
C 2 B3 



, «2 



A3 
S 4 A 4 

c 5 



C 3 

In terms of the a and j3 blocks we have 
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We now consider the following linear equations for the unknown 2q x 2q matrices Vj,j = 
1, 2, . . . , k: 



D' 



( OL X 
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\ 




Z 1 
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-vi \ 


Pi 
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1 
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-v 2 





p2 


a 3 . 
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-v 3 


V o 
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. . Pk-l Oik) 




u 





. 


. 


1-VkJ 



More explicitly, the equations for the Vj are: 

ai vi + Pt = 
a 2 v 2 + Pi vi = 



ttfc Vk + Pk-i Vk-i = 



(A6) 



(A.7) 



(A.8) 



(A.9) 



With vj solving the above equations, we obtain: 

k-i 

det_D' = det ctj] det(ctfc — oikVu)- 

The single unknown is Vk, and the solution is given below: 

otkVk = (-)* ' Pk-iak-\Pk-2ak-2 • • • P2&2 1 Pio-i 1 Pk- 

We obtain, finally, 

fc-i 

detD = {-) q [J] detay]det[a fc + (-/? fe _ia^(-/? fe _ 2 c^ 2 ) . . . (-/fcaj ^(-fta^ 1 )/^. 
i=i 

(A10) 

We chose a form well suited to our applications. The above form can be used to derive 
propagators by varying with respect to various entries of D. 
Our main applications have: 



Pj 



B 
-C - 

-1 ct 



<Xj = \ _ c _ 1 I = a, j = 1 , . . . , — 1 a/- — . _ c , ^ 



B 



(All) 
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We shall also need 



In this case we obtain: 



a 







j_ 

B , 

-C± -1 



detD = (-) 9 (detS) fc - 1 (-)' ? ^- 1) det[a fc - ai-a' 1 (3) k (3' 1 (3 k 
The product —a _1 f3 is related to the transfer matrix T: 

-a' 1 13 = 



(A.12) 



(A.13) 



B 1 i B + 



r 



l\ T -i/0 1 
10/ I 1 



Finally, we can write: 

detD = (-^-^(detBfdet 



-X 
1 



1 
-Y 



{AAA) 



(A.15) 
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